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1.  W.L.  Mills,  C.T.  Mullis,  R.A.  Roberts,  "An  Iterative  Estimation  Technique 
for  Power  Spectra  by  an  ARMA  Model,"  Proceedings  of  the  International 
Conference  on  Acoustics,  Speech  and  Signal  Processing,  Denver,  April  1980. 

Abstract :  Spectral  pole-zero  models  are  used  as  a  method  of  reducing  the 
number  of  parameters  needed  to  represent  certain  classes  of  spectra,  e.g., 
spectral  envelopes  having  deep  valleys.  He  present  here  an  iterative 
algorithm  for  Autoregressive-Moving  Average  (ARMA)  models  that  is  based 
on  a  repeated  use  of  Levinson's  algorithm  to  obtain  both  poles  and  zeros. 
The  procedure  allows  one  to  increase  the  number  of  poles  and  zeros  itera¬ 
tively,  similar  to  linear  prediction  methods.  Empirical  results  indicate 
that  a  slow  growth  of  the  orders  of  the  denominator  and  numerator  poly¬ 
nomials  usually  produces  better  ARMA  models. 


2.  M.  Arjmand,  C.T.  Mullis,  R.A.  Roberts,  "A  Modular  Hardware  Structure  for 

Digital  Filtering,"  Proceedings  of  the  International  Conference  on  Acoustics, 
Speech  and  signal  Processing,  Denver,  April  1980. 

Abstract ;  Good  digital  filter  realizations  for  hardware  implementations, 
in  narrowband  filtering  applications  should  have  low  roundoff  noise,  low 
coefficient  sensitivity,  and  freedom  from  overflow  oscillations.  The  hard¬ 
ware  implementations  presented  here  incorporate  the  above  properties  into 
a  highly  modular  structure  which  can  perform  computations  in  pipeline  fashion. 
That  is,  after  an  initial  delay  an  output  sample  is  obtained  for  each  input 
sample.  The  realization  and  implementations  discussed  here  contain  more 
multipliers  than  direct  form  realizations.  However,  by  using  shorter  word 
lengths  because  of  their  increased  performance  and  distributed  arithmetic 
implementations  (instead  of  multiplier  structures)  these  implementations 
can  have  less  total  hardware  complexity. 


3.  M.  Arjmand,  R.A.  Roberts,  "Efficient  Hardware  Implementations  for  Fixed 
Point  Digital  Filters,"  Proceedings  of  the  International  Conference  on 
Circuits  and  Systems,  Houston,  April  1980. 

Abstract;  The  synthesis  of  a  digital  filter  or  digital  signal  processor  is 
made  up  of  several  design  decisions.  From  a  given  set  of  specifications  one 
first  obtains  an  external  characterization  of  the  filter,  usually  expressed 
in  terms  of  a  transfer  function  H(z).  Given  the  transfer  function,  the  next 
step  is  to  choose  a  realization  for  the  filter  which  defines  the  internal 
structure  of  the  filter.  The  choice  of  a  realization  is  usually  based  on 
minimizing  the  effects  of  finite  register  length  which  include  roundoff  noise, 
coefficient  truncation,  overflow  oscaillations  and  zero-input  limit  cycles. 
After  synthesizing  a  realization  for  H(z)  that  optimizes  one  or  more  of  the 
finite  register  effects,  one  is  faced  with  a  hardware  implementation  decision. 
Which  hardware  implementation  is  the  most  desirable? 

A  hardware  implementation  decision  involves  the  consideration  of  several  para¬ 
meters  such  as  data  throughput,  output  signal  quality,  hardware  complexity, 
total  power  consumed,  etc.  For  a  given  output  signal  quality  hardware  struc¬ 
tures  are  often  compared  on  the  basis  of  speed  and  hardware  complexity.  How 
should  one  measure  hardware  complexity?  There  have  been  several  measures 
used  in  the  literature  .including  pin  count,  chip  count,  total  chip  cost  in 
dollars,  number  of  multipliers,  etc.  These  measures  are  relatively  crude. 
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He  are  using  as  a  measure  of  hardware  complexity  the  area  needed  to 
fabricate  the  device  in  NMOS  process.  This  measure  has  the  advantage  of 
reducing  all  filter  implementations  to  the  common  building  blocks  that 
make  up  the  filter. 

Examples  are  presented  comparing  the  use  of  distributed  arithmetic  (Peled- 
Liu)  implementations  and  more  conventional  multiplier  implementations  of 
fixed  point  digital  filters. 


4.  C.T.  Mullis , .  R.A.  Roberts,  "Normal  Realizations  of  HR  Digital  Filters," 
Proceedings  of  the  International  Conference  on  Acoustics,  Speech  and 
Signal  Processing,  Washington,  D.C.,  1979. 

Abstract :  Normal  realizations  of  IIR  Filters  possess  many  desirable  properties 
including  freedom  from  overflow  oscillations,  low  roundoff  noise  for  many 
pole-zero  locations,  low  coefficient  sensitivity,  and  invariance  of  form 
under  frequency  transformations  of  the  filter.  Given  a  state  variable 
model  {A,b,c,d}  of  a  filter,  we  define  a  normal  structure  as  a  filter  whose 
A  matrix  satisfies  AAT  =  ATA  .  This  paper  develops  design  equations  and 
formulas  for  roundoff  noise  of  normal  forms  in  terms  of  the  poles  and  zeros 
of  the  filter  transfer  function.  Comparisons  of  normal  forms  and  minimal 
noise  forms  are  given  in  geometric  terms  and  give  one  an  intuitive  under¬ 
standing  of  these  structures. 


5.  M.  Arjmand,  R.A.  Roberts,  "Reduced  Multiplier  Low  Roundoff  Noise  Digital 

Filters,"  Proceedings  of  the  International  Conference  on  Acoustics,  Speech 
and  Signal  Processing,  Washington,  D.C.,  1979. 

2 

Abstract:  Minimal  roundoff  noise  digital  filter  structures  require  (n+1) 
multiplies  per  output  sample  for  order  filters.  This  can  be  reduced 
to  approximately  4n  multiplies  if  these  optimal  forms  are  broken  into 
second-order  sections  which  are  then  connected  in  parallel  or  cascade. 

This  compares  to  3n  multiplies  for  the  usual  cascade  or  parallel  connection 
of  second-order  direct  forms.  This  paper  presents  new  reduced  multiplier 
structures  which  have  slightly  increased  roundoff  noise  over  optimal  forms 
but  can  be  orders  of  magnitude  better  (in  roundoff  noise)  than  direct  forms 
for  narrowband  filters.  Results  are  given  for  second-order  sections  with 
seven  and  eight  multiplies  per  section.  (Direct  forms  require  6  multiplies 
per  second-order  section  vis-a-vis  9  multiplies  per  second  order  section 
for  optimal  forms) .  Design  equations  are  presented  for  these  reduced  multi¬ 
plier  structures  in  terms  of  the  poles  and  zeros  of  the  desired  transfer 
f unctions . 


6.  M.  Arjmand,  R.A.  Roberts,  "On  Distributed  Arithmetic  Structures  for  Digital 
Filtering,"  Proceedings  of  the  Thirteenth  Asilomar  Conference  on  Circuits, 
Systems  and  Computers,  Monterey,  California  1979. 

Abstract :  Distributed  arithmetic  structures  are  an  alternative  hardware 
realization  to  the  use  of  conventional  multipliers  in  the  implementation 
of  digital  filters.  This  paper  compares  the  possible  methods  of  partitioning 
the  equations  of  a  second-order  digital  filter  for  a  distributed  arithmetic 


realization.  These  realizations  are  also  compared  to  a  conventional 
multiplier  structure  of  the  filter.  The  comparisons  are  based  on  two 
figures  of  merit  that  measure  the  product  of  the  speed  and  the  complexity 
of  a  given  implementation. 


M.  Arjmand,  R.A.  Roberts,  "Realizations  of  Low  Roundoff  Noise  Digital 
Filters,"  proceedings  of  the  International  Conference  on  Automatic 
Control,  Denver  1979. 

2 

Abstract:  Minimal  roundoff  noise  HR  digital  filters  require  (n+1) 
multiplies  per  output  sample  for  an  nth  order  filter.  By  optimizing 
second-order  sections  which  are  then  paralleled  or  cascaded,  one  can 
reduce  the  number  of  multiplies  to  approximately  4n.  These  realizations 
are  termed  sectional  optimal.  They  are  a  compromise  between  the  minimal 
noise  forms  and  direct  forms  which  require  approximately  3n  multiplies. 

We  present  here  explicit  design  equations  and  noise  performance  equations 
for  low  roundoff  noise  realizations  for  sectional  optimal  forms  and  other 
closely  related  low  noise  forms.  These  latter  realizations  possess 
slightly  greater  noise  than  sectional  optimal  forms  but  have  one  or  two 
less  multipliers  per  second-order  section. 


W„L.  Mills,  C.T.  Mullis,  R.A.  Roberts,  "Digital  Filter  Realizations 
Without  Overflow  Oscillations,"  IEEE  Transactions  on  Acoustics,  Speech 
and  Signal  Processing,  Vol.  ASSP-26,  No.  4,  pp.  334-338,  1978. 

Abstract :  Most  of  the  literature  dealing  with  overflow  oscillation  in 
fixed-point  arithmetic  digital  filters  has  considered  the  direct  form 
exclusively.  It  is  possible  to  eliminate  overflow  oscillations,  regardless 
of  pole  locations,  by  considering  more  general  forms.  A  sufficient  condi¬ 
tion  is  given  for  a  two's  complement  state  variable  realization  of  any 
order  to  be  free  of  overflow  oscillation.  A  simple  characterization  of 
the  condition  is  given  for  second  order  filters.  Among  those  second  order 
forms  which  meet  the  condition  are  normal  forms,  and  forms  which  minimize 
output  roundoff  noise. 


C.T.  Mullis,  R.A.  Roberts,  "On  Weak  Equivalence  of  Linear  Systems  and 
Finite  State  Systems,"  SIAM  Journal  on  Mathematical  Analysis,  Vol.  10, 
No.  3,  pp.  498-511,  1979. 

Abstract:  It  is  shown  that  finite  state  machines  exist  which  are  weakly 
equivalent  to  linear  systems,  for  some  nontrivial  definitions  of  weak 
equivalence.  Two  systems,  one  linear  with  state  space  Hn  ,  the  other 
with  a  finite  state  space,  operate  on  the  same  stationary  uncorrelated 
input  sequence  u.  The  two  systems  have  real  valued  output  sequences  y 
and  y.  Notions  of  weak  equivalence  are  formulated  which  involve  sets  of 
mixed  second  moments  of  the  input  and  two  outputs.  "Power  spectrum 
equivalence"  requires  that  E (ytYt+t^  “  E(ytYt+T)  for  all  t.  "Cross¬ 
correlation  equivalence"  requires  that  E(utyt+T)  a  E(utyt+-r)  for  *11  • 

The  interdependence  of  these  and  other  notions  of  weak  equivalence  are 
studied.  The  existence  of  weakly  equivalent  finite  state  systems  is 
constructively  demonstrated  for  a  standard  class  of  linear  systems. 


W.L.  Mills,  C.T.  Mullis,  R.A.  Roberts,  "Low  Roundoff  Noise  and  Normal 
Realizations  of  Fixed  Point  Digital  Filters,"  to  appear  in  IEEE  Trans¬ 
actions  on  Acoustics,  Speech  and  Signal  Processing. 


Abstract ;  Explicit  design  and  performance  equations  are  given  in  terms 
of  the  parameters  of  the  desired  transfer  function  H(z)  for  two  classes 
of  realizations.  These  realizations  offer  the  digital  circuit  designer 
an  alternative  to  the  usual  designs  based  on  direct  forms.  In  addition 
to  low  roundoff  noise,  these  new  realizations  offer  other  desirable 
properties  such  as  low  coefficient  sensitivities,  and  freedom  from  over¬ 
flow  oscillations.  Numerical  results  are  presented  which  detail  the 
design  process  and  verify  the  theory. 


9.  Research  Findings: 

This  research  has  been  concerned  with  the  problem  of  developing  efficient 
digital  hardware  structures  for  signal  processing.  Our  point-of-view  is  somewhat 
different  from  that  adopted  by  designers  of  these  devices  that  are  working  in 
the  industrial  sector.  They  have  taken  the  point-of-view  that  hardware  must  be 
versatile.  It  must  perform  more  than  one  signal  processing  task.  Thus  many 
functions  of  the  hardware  must  be  controlled  by  software  programs  tailored  for  the 
particular  class  of  problems.  This  kind  of  versatility  can  only  be  obtained  by 
an  increase  in  hardware  complexity  and  a  reduction  in  the  data  throughput  of 
the  device . 

He  have  chosen  to  optimize  digital  processing  structures  for  each  particular 
task  the  processor  is  asked  to  perform.  This  not  only  simplifies  the  complexity 
of  the  structure  but  also  greatly  increases  the  data  throughput  of  the  device. 

We  have  concentrated  on  two  signal  processing  tasks,  digital  filtering  and  spectral 
estimation. 

The  first  problem  we  had  to  solve  was  how  to  meaningfully  compare  hardware 
structures.  Complexity  measures  used  in  the  literature  are  based  on  various 
parameters  such  as  chip  count  or  pin  count  of  the  IC  components.  These  measures 
are  difficult  to  incorporate  into  a  theory.  We  are  using  what  we  consider  a 
more  fundamental  measure  —  namely  the  area  needed  to  fabricate  a  given  hardware 
design  in  an  nMOS  process.  This  kind  of  area  calculation  quantifies  each  hardware 
design  in  terms  of  its  most  fundamental  elements  and  thus  appears  to  be  a  good 
measure  of  hardware  complexity. 

Hardware  complexity  is,  of  course,  only  part  of  the  story.  Data  throughput 
is  another  basic  parameter  of  any  signal  processing  device.  Thus  as  a  figure 
of  merit  for  compar ing  various  hardware  designs  we  have  used: 

Figure  of  merit  »  (nMOS  chip  area)  x  (^or(j^rate> 
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We  have  used  this  figure  of  merit  in  developing  our  theory  of  efficient  hardware 
structures . 

There  are  essentially  four  parameters  that  appear  to  be  the  basis  for  under¬ 
standing  and  quantifying  competing  hardware  designs.  These  parameters  are: 

(a)  Concurrency  of  computation.  This  concept  includes  all  forms  of 
parallelism  and  pipelining  used  to  increase  data  throughput. 

(b)  Good  output  signal  quality.  Zn  any  digital  hardware  device,  finite 
register  effects  will  affect  the  output  quality  of  the  processor  adversely.  These 
effects  can  be  alleviated  by  increasing  the  word  length  of  the  digital  device. 
However,  increasing  word  lengths  increases  hardware  complexity  and  reduces  data 
throughput.  Thus  it  is  very  important  to  reduce  word  lengths  to  as  small  a 
value  as  possible.  This  implies  one  must  understand  what  realization  or  signal 
flow  graph  of  a  given  input-output  characterization  is  the  best  in  the  sense  of 
minimizing  finite  register  effects. 

(c)  Modularity  and  regularity  of  design.  This  idea  is  difficult  to  quantify 
but  intuitively  it  is  easy  to  see  why  it  is  important.  Modularity  of  design 
refers  to  a  structure  which  is  made  up  of  identical  modules  with  simple  and 
regular  connections  between  the. modules.  This  type  of  structure  offers  the  designer 
many  advantages  including: 

(1)  a  simple  design  process  for  the  input-output  specification; 

(2)  a  simple, implementation  process  for  the  actual  construction  of 
the  device; 

(3)  a  systematic  procedure  for  verifying  and  checking  the  logic  and 
interconnections  of  the  device.  This  can  be  a  very  inportant 
aspect  in  the  design  of  devices  with  a  large  number  of  elementary 
components  such  as  occurs  in  VLSI. 

<d)  Hardware  complexity.  We  measure  this  parameter  by  calculating  the 
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equivalent  chip  area  needed  to  fabricate  the  device  in  nMOS.  In  studying 
hardware  complexity  we  have  found  that  distributed  arithmetic  (see  Abstract  3 
on  Efficient  Hardware  Implementations)  often  results  in  lower  complexity  them 
the  use  of  monolithic  multipliers.  We  have  spoken  with  many  other  hardware 
designers  and  they  have  reached  similar  conclusions  on  the  desirability  of 
using  distributed  arithmetic  when  it  is  feasible  to  do  so. 

We  have  incorporated  these  ideas  in  the  search  for  good  designs  for  linear 
filtering  and  spectral  estimation.  We  believe  we  have  found  an  excellent  struc¬ 
ture  for  linear  filtering.  This  structure's  attributes  are  summarized  in 
Abstract  #2  on  "A  Modular  Hardware  Structure". 

We  have  also  considered  certain  structures  for  spectral  estimation  that 
as  are  yet  unpublished.  We  believe  we  are  close  to  some  good  structures  for 
this  task  also. 

We  have,  in  the  course  of  this  research,  developed  some  side  results  not 
precisely  in  the  main  thrust  of  the  proposal  which  will  also  prove  to  be  of 
value.  These  include: 

(a)  The  development  of  two  new  algorithms  for  ARMA  spectral  estimation 
(see  Abstract  #1) . 

(b)  The  development  of  a  multitier  lattice  structure  for  use  in  both 
linear  filtering  and  spectral  estimation. 

(c)  An  investigation  of  "block  processing"  in  linear  filtering  which  can 
be  used  to  improve  performance.  Block  processing  is  an  old  idea  in  which  one 
processes  data  in  blocks  rather  than  one  sample  at  a  time.  The  results  are  also 
produced  in  blocks.  This  seemingly  simple  idea  can  lead  to  some  dramatic  improve¬ 
ments  in  processing  performance.  This  work  has  not  yet  been  submitted  for  publication, 


